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ABSTRACT 

Aims. We present a method of selection of 24 ^m galaxies from the AKARI North Ecliptic Pole (NEP) Deep Field 
down to 150 ^Jy and measurements of their two-point correlation function. We aim to associate various 24 ^m selected 
galaxy populations with present day galaxies and to investigate the impact of their environment on the direction of 
their subsequent evolution. 

Methods. We discuss using of Support Vector Machines (SVM) algorithm applied to infrared photometric data to 
perform star-galaxy separation, in which we achieve an accuracy higher than 80%. The photometric redshift information, 
obtained through the CIGALE code, is used to explore the redshift dependence of the correlation function parameter 
(ro) as well as the linear bias evolution. This parameter relates galaxy distribution to the one of the underlying dark 
matter. We connect the investigated sources to their potential local descendants through a simplified model of the 
clustering evolution without interactions. 

Results. We observe two different populations of star-forming galaxies, at Zmed ~ 0.25, z^ed ~ 0.9. Measurements of total 
infrared luminosities (Ltir) show that the sample at z^ed 0.25 is composed mostly of local star-forming galaxies, 
while the sample at Zmed ~ 0.9 is composed of luminous infrared galaxies (LIRGs) with Ltir ~ Lq. We find 

that dark halo mass is not necessarily correlated with the Ltir. for subsamples with Ltir = at Zm^d 0.7 

we observe a higher clustering length (ro = 6.21 ± 0.78 [/i“^Mpc]) than for a subsample with mean Ltir = 
at «med 1.1 (j'o = 5.86 ± 0.69 /i“^Mpc). We find that galaxies at 0.9 can be ancestors of present day L« early 

type galaxies, which exhibit a very high ro ~ 8 /i“^Mpc. 

Key words, infrared: galaxies - infrared: stars - galaxies: fundamental parameters - galaxies: statistics 


1. Introduction 

The connection between the properties of galaxies, such 
as morphology, luminosity, colour, surface brightness, 
or specific star formation rate (SFR), and the lo- 


mented (e.g., Marinoni et al.l 19991: iMarinoni et al. 20021: 

Hogg et al.l 2004 iWeinmann et al.ll2006l: 

Hirschmann et al.l 

20131: Boselli et al. 2014: Guglielmo et al. 

2015fl over a wide 


to influence the determination of different galaxy prop¬ 
erties. However, intrinsi c physical pro cesses, such as su¬ 
pernovae feedback (e.g., 5oissieill201,3li or a central black 


range of redshifts. Different environments, from rich clus¬ 
ters to very low density areas, can have a significant influ¬ 
ence on formation and a subsequent evolution of galaxies. A 
rang e of environmental mecha nisms, such as galaxy merg¬ 
ers (Toomre fc Toomrd I1972D. harrassment (|Moore et al.l 


11996 1. and gas stripping ( Gunn fc Gottl[T97^ are expected 


pernovae t eedback (e.g.. lljoissiei1l2()i;ill or a central black 
hole (e.g., iGhoi et ^1201511 . could be essential in putting 
a galaxy into a specific evolutionary path. One main ques¬ 
tion remains unanswered: are the observed dependencies a 
product of a cumulation of many processes over the cos¬ 
mic time, or were they predetermined at the time of first 
galaxy assembly. One of the approaches that can address 
these issues involves studying th e statistical correlation be¬ 
tween the fluctuations of mass (jKaiseil 11984 iMo fc White! 
119961 : l^rdeen et ^Il986ll . In this context, one way to shed 
light on the nature of the link between the baryonic com¬ 
ponent and the underlying matter distribution is to explore 
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the properties of a specific type of galaxy, and relate them 
to the parent dark matter halo (DMH). 

As the hierarchic al model of structure formation (e.g., 
iGranato et al.l 1200011 predicts, it is expected that galaxy 
formation, as well as star formation, is intensively driven 
by mergers. In this case, the first bursts of star forma¬ 
tion would occur in the denser regions of the Universe 
before appear ing in the less dense and therefore less clus¬ 
tered regions (lElbaz fc: Cesarskvll2003ll . Studies of the cos - 
mic infrared background radiation (e.g., iHauser et al.lll99^ 
have shown that at least half of the total observable 
energy generated by stars has been absorbed by dust 
and reprocessed into infrared light. This result has indi¬ 
cated that the dust obscured star formati on must have 
been much stronger at earlier epochs (e.g., iLagache et al.l 
I 2 OO 5 I: llVanceschini et al.ll2008ll . Many authors report that 
at increasing redshifts, extreme star forming (SF) galax¬ 
ies, which bec ome more prominent , are heavily ob s cured 
by dust (e.g., Hogkins.et_a]J_ 20011: iTakeuchi et al.l 120051 : 


iBuat et ^120011 ISullivan et al.ll20^. 

Observations with infrared (I R) satellites like 
Infra red Space Observatory (ISO, iGenzel fc Cesarsk^ 
2000[) and the Infr ared Astronomical Satellite (IRAS; 
Lonsdale fc Hackind Il989tl . have revealed the existence 


of thousands of galaxies that emit strongly at IR wave¬ 
lengths. With the advent of a more recent generation 


of IR satellites like the Spitz( 
iPanovich et al. 2004 Dole et al. 

Space Telescope (e.g., 
2004' iFraver et al.l 1200611 

and AKARI ([Matsuhara et al.| 

2006a). much larger and 


deeper samples of IR. sources (|Le Floc’h et al.ll200,^ with 
improved sensitivities were provided. 

As star formation is an extremely important factor in 
galaxy evolution, it is of crucial importance to study the 
properties of galaxies that are actively forming stars. Of 
particular interest are mid-infrared (MIR) observations, as 
they trace the dust emission resulting from the heating by 
high-energy photons, and therefore they are invaluable in 
detecting sources of actively forming stars. The MIR se¬ 
lection is affected by prominent spectral features. These 
features include strong emission from the polycyclic aro¬ 
matic hydrocarbons (PAH) at rest-frame wavelengths of 
3.3, 6.2, 7.7, 8.6, II.3, 12.7, 16.3, and 17 /xm, which can in¬ 
crease the number of sources within a flux-limited sample, 
or silicate absorption feature at 9.7 /xm, which can cause 
a deficit in detection of the targeted sources. Obtaining 
knowledge about the way that different SF galaxy popula¬ 
tions are placed in the context of the large scale structure 
(LSS) of the Universe can help to understand their origin 
and determine their future fate. 


So far, in the field of clustering of mid-IR sources, many 
authors have relied on the data collected through Spitzer’s 
24 fim band, mainly because those wavelengths can pro¬ 
vide a broad view of SF sources at different redshifts. The 
published results provided great insight into the connec¬ 
tion between the underlying dark matter distribution of IR 
galaxies and their evolution: 


— iGilli fc Daddil (l2007ll presented spatial clustering mea¬ 
surements of ~800 galaxies from SWIRF MIPS 24 /xm 
passband, observed at z ^ 1. They found that depend¬ 
ing on the different cuts of the total infrared luminos¬ 
ity (Ltir) galaxies cluster in a different way: objects 
with higher Ltir cluster with greater strength. This 
implies that galaxies with higher star formation activ¬ 


ity are hosted by more massive DMHs and denser envi¬ 
ronments than currently predicted by galaxy formation 
models. _ 

— iMagliocchetti et ^ (l2008ll used ~ 1040 SWIRF MIPS 
24 /xm selected sources brighter than 400 ^Jy from two 
redshift intervals: at z = [0.6 — 1.2] and z > 1.6, to 
discover, that both populations are highly clustered. 
The high-z population was found to reside in very mas¬ 
sive halos (comparable with local hosts of groups-to- 
clusters), while the low-z population resides on average 
in smaller DMH. Moreover, from the IR photometry it 
was revealed that both samples contain a similar mix¬ 
ture of star-forming galaxies and active galactic nuclei 
(AGN). 

— IStarikova et al.l (120121) have analyzed properties of ~ 
20000 objects above a 310^Jy flux limit in MIPS 24 /xm 
passband. They reported the existence of two popula¬ 
tions of IR galaxies at Zmed 0.7 and z^ed ^ 1-7- They 
reveal that both samples represent different populations 
of objects, which are found in differently sized DMHs, 
with the higher-z objects residing in progressively more 
massive halos. 

A general conclusion throughout the literature is that 
24 /xm sources are composed of two distinct populations 
of objects inhabiting different structures: the z ~ 1 dust- 
enshrouded SF galaxies and very massive systems at z ~ 
2 where the active phase has ceased at lower redshifts. 
However, all of the above Spitzer studies differ amongst 
each other in many aspects, such as the sample selection 
criteria (mostly based on the existence of counterparts in 
optical wavelengths), different methods of redshift estima¬ 
tion, and drastically varying surface density of the sources. 
In light of those facts, it would be advisable to take an in¬ 
dependent look at the 24 /xm sources, from a perspective of 
another satellite, with a source selection method, which is 
not based on the existence of counterparts in optical wave¬ 
lengths. 

The AKARI satellite was designed to carry out IR ob¬ 
servations with a sensitivity and resolution higher than that 
of preceding missions. It was launched by JAXA’s MVS ve¬ 
hicle on February 22, 2006, carrying out, amongst others, 
a deep survey of the North Ecliptic Pole region (hereafter 
NEP), which we aim to use to explore the mid-infrared 
properties of galaxies, in particular, the evolution of clus¬ 
tering. With a dense IR wavelength coverage, AKARI is 
well suited for delivering an independent look at the evo¬ 
lution of composition and clustering of different IR galaxy 
populations. In this research, we study the clustering prop¬ 
erties of 24 /xm galaxies from a perspective of an infrared 
satellite other than Spitzer, utilizing a new source selection 
method based on the usage of IR data alone. 

The paper is organized as follows: the description of 
the data, the method used to select the galaxies, and the 
way the redshifts were obtained can be found in Sect. 2; in 
Section 3 we present and summarize the methods used to 
calculate the angular correlation function and galaxy mass 
bias. Sect. 4 presents the results of application of those 
methods to the NEP data and we discuss and compare our 
findings with previous studies in IR and optical surveys. 
The summary and conclusions are given in Sect. 5. 

Throughout this analysis we have assumed a fiat A 
CDM cosmology with Dm = 0.27, Da = 0.73 and HqIOO h 
km Mpc~^. 
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2. The data 

The NEP Deep sky survey cove rs an area of 0.4 sq. de g 
around the North Ecliptic Pole (iMatsuhara et al]l2006bll . 
The data were ob tained by the Infrared Camera (IRC; 
lOnaka et al.l l2007ll through nine near- and mid-infrared 
(NIR and MIR) filters, centred at 2 /rm {N2), 3 ^m {N3), 
4 ^m (A^4), 7 fj,m (57), 9 /rm (59IT), 11 fim (511), 
15 /im (T15), 18 /rm (L181T), and 24 ^m (T24), where 
W indicates that the bandwidths are wider than the oth¬ 
ers. The point spread function (PSF) has a beam size of 
^ 5 arcsec (depending on the wavelength it varies be¬ 
tween 4.4 and 5.8), which makes AKARTs imaging supe¬ 
rior to that of other IR satellites. The source extraction 
on FITS images was do ne using the SExtractor software 
(jBertin fc Arnout^ Il996ll . A source is assumed to be de¬ 
tected if it has a minimum of 5 contiguous pixels above 
1.65 times the RMS fluctuations. Instead of allowing the 
program to estimate the b ackground, weight maps were 
used Isee IWada et al.l [200811 . Photometry was carried out 
using SExtractor’s MAGAUTO variable elliptical aperture 
with aperture parameters: Kron factor and minimum ra¬ 
dius set to 2.5 and 3.5, respectively. The zero magnitude 
points were derived from observations of standard stars 
(jTanabe et al.ll20d^ and are used to convert counts to mag¬ 
nitude by the photometry program. The number of sources 
detected in individual filters differs significantly: far more 
sources are detected in NIR than in MIR. The photometry 
resulted in detection limit of 19.3 /iJy at 24 /rm (L24 fil¬ 
ter). The results of this procedure were downloaded from 
the official AKARI Researchers Web Pag43, however, we 
have conducted security check runs of SExtractor, and the 
parameters obtained from this independent run were used 
in the subsequent analysis, after confirming that the basic 
results were consistent with the original catalog. 

2.1. 24 iim galaxy sample selection 

The first attempt to separate stars and galaxies within the 
NEP data was presented in (ISolarz et al.ll20l'^ . where we 
exploited the Support Vector Machines (SVM) algorithm 
using all available color information for AKARI objects. 
Here, we employ the same method, this time aiming at 
selection of a pure 24 pm galaxy sample, which we use 
later on to explore the clustering properties of these kinds 
of objects. In this subsection,;1;1; we give a brief summary 
of the principles governing the selection method and then 
we describe how it was applied to obtain the best and most 
secure results. 


2.1.1. Method 


To separate stars and galaxies we employ the Support 
Vector Machines (SVM) algor ithm, a supervised method 
based on kernel methods (|Shawe-Tavlor fc Cristianlnij 
I2OO4II , allowing for pattern recognition within the provided 
data. The advantage that the SVM algorithm has over 
other algorithms is the ability to use all available infor¬ 
mation simultaneousl y, which has proven to be of great 
use in as tronomy ('e.g..[W ozniak et al.l 2004 : Zhang fc Zhaol 
2004 or [Huert a s-Companv et al.ll2008t ISolarz et al.f 2012t 


Malek et ahl [20f3ll . Here, we outline the basic idea be- 


http://www.ir.isas.jaxa.jp/ASTRO-F/Observation/ 


hin d the method, ho we ver, for a more in-depth descriptio n 
see lHsu et al.1 (l2003ll or ICristianini fc Shawe-Tavlorl (I2OOOII . 
The task of the SVM algorithm is to divide the data points 
into two (or more) subsets according to the previously cho¬ 
sen adumbrative samples of desired classes by a supervisor, 
and to create a separating hyperplane, which serves as the 
decision boundary for the data that we want to classify. 
To train the SVM algorithm means to input a feature vec¬ 
tor for each object in training example, i.e., quantities that 
describe the properties of a given class’ object, so we are 
mapping the input data from the input space X onto a 
feature space (which can consist of an infinite number of 
dimensions) H using a nonlinear function: (j): X ^ H. In 
the parameter space H the function that determines the 
boundary can be written as 

n 

f{x) = '^aik{x,x') + b, (1) 

i=l 

where k{x,x') is the kernel function returning an inner 
product of the mapped vectors, ai is a linear coefficient, and 
6 is a perpendicular distance, which translates the bound¬ 
ary in a given direction. 


2.1.2. Application to the NEP data 

In ISolarz et al.l (|2012ll . this method was directly applied 
to the NEP Deep sources with full photometric informa¬ 
tion. With the AKARI IRC flux measurements, we built 
a 6D parameter space using the following color indices: 
V2-V3, iV3-V4, iV4-57,57-511,5II-L15, LI5-LI8. 
We excluded the SW9 and L24 filters in order to classify 
as many objects as possible, since the amount of sources 
detected in those passbands was the smallest. Then, 
samples containing stars and galaxies, chosen by their 
stellarity parameter (sgc) value measured in NIR, were 
used to train the SVM and obtain its classifier. The 
results reached the total classification accuracy of 93 %, 
with specific accuracies of 98 % for selecting stars and 
90 % for selecting galaxies. The results were tested with 
the auxiliary optical identifications and with the source 
count models, all of which proved that the classification 
based on infrared data alone is very efficient. Therefore 
it was a natural next step to test it on the sources that 
were missing measurements due to the masking procedure 
and/or actual physical dropout properties. 

To create a catalog of 24 pm AKARI galaxies, we 
performed a typical four-step routine for the application of 
SVMs to the classification task, as follows. 

1. Manual selection of subsets of objects, which are 
representative to their predefined class, and serve as a 
training basis for creating a classifier. 

2. Each training source has to be described by its dis¬ 
criminating properties; in other words, for each training 
example there has to be a corresponding feature vector. 

3. Selection of the kernel function. In this research we only 
use the radial basis function. 

4. Training the algorithm to learn how to distinguish 
objects by creating a separation hyperplane described by 
two adjustable parameters (C, cr). 

To complete these steps, firstly, as a training sample we 
used previously classified stars and galaxies detected in all 
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the AKARI passbands. Then, we constructed the feature 
space from seven IR colors. Based on the training sources 
we created a classifier by performing a grid search of two 
free parameters (cr, responsible for the shape of the hyper¬ 
plane, and C, governing the amount of missclassifications 
for which we allow) and a tenfold cross-validation tech¬ 
nique. The final pair of parameters {{C,a) = (10°, 10“^)) 
was chosen based on the best accuracy of correctly clas¬ 
sifying stars and galaxies within the training sample. The 
accuracy is defined as a ratio of all galaxies and stars whose 
nature was properly recognized by the classifier to the to¬ 
tal number of considered objects. The final classifier was 
applied to all sources detected in the 24 fim band (2208 
object^ to infer their respective classes. 

The resultant classifier exhibited a 83.89 % total ac¬ 
curacy, with the true galaxy rate (TGR) and true star 
rate (TSR) for selecting stars and galaxies as 97.69 % and 
66.40 %, respectively. The specific accuracies are defined as 
follows: 


TGR = 


TSR = 


TG 

TG + FS' 
TS 

TS + FG' 


where TS stands for a true star (actual star identified as 
a star by SVM), TG is a true galaxy (actual galaxy iden¬ 
tified as a galaxy by SVM), TG is a false galaxy (actual 
star classified as a galaxy by SVM) and TS” is a false star 
(actual galaxy classified as a star by SVM). As the classifier 
was built with a primary objective to select galaxies, any 
objects with rough spectral shape that was too different 
from a galaxy was classified as a so-called star. That how¬ 
ever does not mean that such sources are actually stars. 
The primary criteria for classification of an object were not 
only infrared colo rs, but also measu rements of the sources’ 
compactness Isee lSolarz et al.ir2012ll . That is why in addi¬ 
tion to stars this sample could contain, for example, com¬ 
pact galaxies and AGNs. Therefore this group of objects 
shall be called a reject sample. As the class of those sources 
may be of a nature that we have not considered here, we 
left them out of the analysis to preserve the purity of the 
galaxy sample. The rejected objects’ number is 799, and, as 
stated before, this sample contains a mixture of stars and 
possible galaxies displaying unusual properties that have 
caused their displacement with respect to the separation 
hyperplane. 

The resultant galaxy sample contains sources that are 
almost always detected also in NIR-N passbands. This sug¬ 
gests that our sample does not include a higher redshift 
fraction of int ensively SF galaxie s expected at z ~ 1.6 and 
2.7 (see, e.g., iHouck et al.ll200^ . a population that as a 
result of high dust content is obscured in optical and near- 
infrared part of the spectrum. Nevertheless, as we report in 
the later sections, we find tentative evidence for a separate 
high redshift population of these kinds of galaxies. 

Estimation of the contamination of our samples is based 
on the number of objects belonging to the two classes that 
lie within the hyperplane boundary. If an SVM classified 
galaxy’s (or star’s) position in multicolor space has a sep¬ 
aration boundary distance smaller than the error bar, it is 
treated as a possible misclassification. On this basis, the 


® The number corresponds to the amount of objects left after 
masking unreliable regions of the image. 


contamination in our galaxy sample is estimated to be 4.97 
%. After a rejection of the boundary region, the final cata¬ 
log contains 1339 galaxies. 

Figure [1] shows positions of galaxies in the final 24-/rm 
selected catalog within the masked NEP region. The mask¬ 
ing procedure included the removal of bad columns, high- 
noise areas and overexposed bright objects (like the Cat’s 
Eye Nebula, which covered almost one-third of the field in 
24 /xm image.) 



268.0 268.5 269.0 269.5 270.0 

rec [deg] 


Fig. 1: Position map of the MIR galaxy sample chosen by 
means of the Support Vector Machines algorithm (blue 
points). Red points mark the shape of the mask of the field 


We estimate the completeness of the derived catalog 
empirically to be 150 /iJy, by defining the limit as a point of 
the maximum of the differential number counts of sources. 


2.2. Redshift estimation 

Since follow-up spectroscopic measurements are time con¬ 
suming and, for the AKARI NEP Deep Field, the existing 
measurements are very sparse (which is addressed further 
in the text), and we have to rely on the photometric estima¬ 
tion of the redshifts. The usual photometric redshift esti¬ 
mation codes are based on fitting the abundan ce of optical 
temp lates for sources, like the LePhare routine (jllbert et al.l 
I2006D . However, since the only measurements available for 
our sources are performed in the NIR and MIR, we had 
to approach this task in an alternative way. To this aim, we 
ran the Code I nvestigating GALaxy Emission (CIGALE; 
iNoll et all 1200911 ■ a spectral energy distribution (SED) fit¬ 
ting routine. CIGALE was not developed as a tool for esti¬ 
mation of photo-z but since it uses a large number of mod¬ 
els covering the whole spectrum including IR, it may be 
expected to provide ^photo using only the infrared part of 
spectra (|Malek et al.l 1201^ with satisfactory quality. The 
CIGALE technique uses models describing the emission 
from a galaxy in the wavelength range from far-UV to 
far-IR. Models o f e mission from stars are given eith er by 
iMarastoiil (l2005ll or [Fioc fc R.occa-Volmeranra (|l997ll . The 
absorption and scattering of star light by dus t, the so-called 
attenu ation curves for galaxies, are given bv ICalzetti et all 
(|200nh . Dust emission is characterized by a model proposed 
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by iDale fc Heloij (|2002f) . In this model, the IR part of an 
SED is given by a power-law distribution 

dMd(C/) oc (2) 

where Md{U) is the mass of dust heated by a radiation field 
{7, and osed is a heating intensity. 

To find a photometric redshift with CIGALE, it is nec¬ 
essary to shift a galaxy spectrum to many different red- 
shifts and then to run CIGALE to fit SEDs for the same 
galaxy with a number of combinations of different redshift 
values. We decided to estimate photometric redshift for the 
AKARI NEP sample in the redshift range from Zmin = 0.01 
to Zniax = 3.00 with a step 6z=0.02. The final redshifts as¬ 
signed by CIGALE are based on the minimal value of 
the fi t with 7 degrees of freedom (for details, see lNoll et al.l 
1200911 . In the same sample of 1339 galaxies, it was not pos¬ 
sible to estimate photometric redshift equal to 99.99) 
for only 4.85 % of the sources. Eor 38.31 % of the galaxies, 
the was lower than 10. 

The distribution of the is shown in Figure [21 The 
resultant redshift distribution obtained using GIGALE is 
presented in the Fig. 3, where it is compared to the other 
estimations previously published in the literature. The de¬ 
tailed description of the appli cation of CIGALE t o AKARI 
data is discussed at length in iMalek et al.l (l2014ll . 



Fig. 2: The distribution of the x^ value obtained by red¬ 
shift estimation analysis (shown in bins of 5.0). The vertical 
dashed line marks a cut at x^ = 10, corresponding to the 
best SED fits used later for the analysis. Sources with 
values equal to 99.99 are not shown. 


Previous studies of photometric redshift estimates of the 
24 /rm selected sources have identified two peaks in the 
redshift distribution. Depending on the flux density limits, 
many studies have reported a rise in the d etection of sources 
at z < 1 peak ing between z ^ 0.3 (e.g., Desai et al.ir2008ll 
and z ^ 0.8 (lP4rez-Gonzalez et al.l 120051 : Le Floc’h et al.l 
l200,5l : ICamiti et al.ll200fill . At these redshifts there are no 
prominent spectral features in the 24 /rm band except for 
the ^ 16 and 17 /jm PA H emission bands. Some authors 
(e.g., iGamiti et al.ll200^ have reported that between the 


redshifts of z ^ 0.8 to z ^ 1.6 the number of detected 24 
/rm objects decreases significantly, possibly because of the 
deep silicate absorption line features at 9.7 /rm rest frame 
causing s ome sources t o fall out of the flux detection limit. 
However, iDesai etlil] (1200811 report a tentative detection 
of a distribution peak at z ^ 0.9, which has not been de¬ 
tected in previous studies. This distribution peak can be 
attributed to the 12.7 /rm PAH feature and 12.8 /rm [Nell] 
emission line, which at this redshift is seen in the 24 /xm 
passband. 

An additional peak predicted by models (e.g., 
iLagache et al.l[200ll . which occurs at z ~ 2, can be at¬ 
tributed to sources dominated either by an AGN, or as 
a consequence of PAH emission features at ^ 8/xm (e.g., 
ICaouti et al.ll200^ . 

The results of our photometric redshift derivation using 
the GIGALE code reveals results that are in agreement with 
these studies (see Fig. 3). We observed two expected peaks 
in the photo-z distribution, one located at z ^ 0.6 and the 
weaker secondary peak located at z ~ 1.2. Moreover, we 
detect a noticeable rise in the counts at z ~ 2.4. 



0.0 0.5 1.0 1.5 2.0 2.5 3.0 


z 


Fig. 3: Redshift distribution of the AKARI NEP Deep Field 
MIR selected galaxies derived by GIGALE code (solid line) 
together with those pub lished in the literature (dashed: 


iMagliocchetti et ^ 


dotted: Desai et al 


120081 : d ash-dot-dashed and dash-triple- 


(|2008ll . for objec ts with and w i thout 
spectroscop ic redshifts: large-das hed: iGilli fc Daddil 120071 : 


spectroscop ic reasnitts: large-das tK 
and dotted: (Starikova et al.ll20i^ . 


iMurata et al.l (|2013ll recently released a catalog with 
measurements of spectroscopic redshifts for a sample of 
307 objects within the AKARI NEP Deep field obtained 
by DEep Imaging Multi-Object Spec trograph (DEIMOS) 
in Keck H telescope (iFaber et al.ll2003l : Goto et al. in prep). 
Since the best way to estimate the accuracy of the photo-z 
measurement procedure is to check it with directly mea¬ 
sured spectroscopic redshifts, we h a ve cro ss-correlated our 
catalog with that of iMnrata et al.l (|2013ri . We have found 
149 counterparts from the spectroscopic catalog. Six of 
them resulted in GIGALE finding no believable redshift 
(x^ > 99). To check the reliability of the photometric esti¬ 
mations we estimate the a mount of the catas trophic errors 
(CE) for sources, following lllbert et al.l (|20n9ll . A CE occurs 
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when the value of 

^ ^ |zspec - (photo-z)l 

1 2sp0c 

exceeds 0.15. We estimate this value for several limiting 
cuts to determine the value that is the most reliable for 
a clustering analysis. Figure |4] shows the comparison be¬ 
tween the derived photometric redshifts and spectral mea¬ 
surements, whereas Tabled] presents the results of this pro¬ 
cedure. The amount of the CE reaches 18.37% for < 10 
and then increases significantly with the increase of the 
value until it reaches a value of almost twice as high for the 
< 70. 

Table 1: Comparison of zcigale vs Zgpec as a function of 
X^- V ~ percent of CE in the sample. 



Ngal 

Ncb 

»7[%1 

70 

126 

44 

34.92 

50 

112 

39 

34.82 

30 

73 

21 

28.77 

20 

65 

18 

27.69 

10 

49 

9 

18.37 
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Fig. 6: The redshift distribution of the AKARI NEP Deep 
sources obtained by the CIGALE code as a function of the 
limiting x^ value. Distribution of all sources, with a suc¬ 
cessfully calculated Zphoto is represented by the solid line, 
distribution of sources with photometric redshift measure¬ 
ments on the level of x^ < 10 by dash-dotted line, and 
X^ < 70 by dash-triple-dotted line. 



Fig. 4: Comparison between zcigale and Zspectro- Different 
symbols correspond to different x^ values: blue crosses de¬ 
note objects with x^ < 10, black asterisks objects with 10 < 
X^ < 70, and orange squares objects with 70 < x^ < 99. 
Solid lines are for zgigale — .2:spectro 0.15(1 F Zapectj-oj. 


In Fig.[S|we present the examples of the resultant SEDs 
obtained with different x^ values. It is clear that for values 
from x^ 40 up to the 70 we are still dealing with the 
very good fits to the data. The fits with values of x^ 80 
no longer seem reliable. The redshift distributions of the 
sources included within a specific x^ cut are presented in 
Fig. ini For the most reliable results, we decided to replace 
the photometric redshifts for objects that have a spectro¬ 
scopic measurement from the Murata et al. catalog, and as¬ 
signed them the value of x^ = 0- This procedure increased 


the number of sources from 511 to 611 for the x^ < 10 sam¬ 
ple, from 1195 to 1220 for the x^ < 70, and from 1268 to 
1274 for the full catalog without the galaxies for which the 
CIGALE code was unable to retrieve the redshift. 


3. The angular correlation function 

Galaxy clustering is traditionally analyzed in the first place 
by the means of the two-point correlation function, ^(r), 
which is treated as an above random excess probability of 
finding a galaxy within a certain distance r from another 
galaxy. Often it is approximated by a power law ^(r) = 


3.1. Estimator 

To compute the angular correlat ion function, we use th e 
following estimator introduced bv iLandv fc Szala^ (Il993ll : 

DD(e) - 2DR(e) + RRW 

=- ms) - ' ’ 

where DD(0) is the number of galaxy-galaxy pairs, RR(6*) 
is the number of random-random pairs, and DR(0) is the 
number of galaxy-random pairs within an angular bin of 
separation on the sky 6. 

We use 10^ homogeneously generated random points 
within the FOV and overlay the photometric mask with the 
identical features as the real FOV. Because of the masking 
of bad sectors of the FITS file for the 24 /im passband, the 
effective number of galaxies used for the calculation was 
signihcantly reduced (by ^ 36 % of the original number 
of sources). The correlation function was computed in the 
angular bins of width <51og0 = 0.1. 
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Fig. 5: Examples of the CIGALE fitted SED’s to the AKARI NEP Deep data for different values. Each plot has a 
close-up on the photometric data in the linear scale. 


3.2. Errors 


out. Then, the errors were estimated from Eq. [i 


The uncertainties arising from computation of the correla¬ 
tion function through the Landy-Szalay estimator are 


5w{9) 


1 -I- uj[0) 

x/DD(^’ 


(5) 


which accounts for the standard deviation in the pair count¬ 
ing along with the intrinsic variance. However it does not 
include the error arising from the covariance of the correla¬ 
tion function at different separations. There are a number of 
ways to avoid this problem; f or instance, using b ootstrap 
resampling of the data (e.g., Barrow et al.iri984l ), mock 
catalogs (e. g..lPollo et al.ll20 0^. jackknife resampling (e.g . , 
iGaztana^ 11994 iRoss et al.l 120071 : IScranton et al.1 12002 1. 


and many others. We employ the jackknife resampling of 
the data to calculate errors. To this aim we divided the ob¬ 
served field into ten equally sized areas and computed the 
correlation function ten times, each time leaving one area 


10 

( 6 ) 

i=l 

where i denotes a particular subsample iteration. We calcu¬ 
lated all presented uncertainties from here on in this man¬ 
ner. 

3.3. Fitting procedure and integral constraint 

Usually a correlation function follows the power law 

a;(0)=A^(0i-^), (7) 

le.g.. lPeeblejll994l . Every estimate of uj{9) is penalized by 
the finite size of the sur veyed area and referred to as the ” in¬ 
tegral constraint” HG. [Peebles fc Grotblll976h . This effect 
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causes a negative offset of the observed uj{0) with respect 
to the true correlation function, by a constant 


IC 


J j a;T(0)df7idO2, 


where dfli and dn 2 are the transverse solid angles of each 
pair. It can be approximated using the following formula: 




( 8 ) 


introduced by iRoche &: EalesI (Il999ll . After applying this 
correction, the correlation function can be written as 
u}{9) = — IC). To determine the best-fit param¬ 

eters ( 7 , A), we performed a minimization including the 
full covariance matrix C, i.e., 


= [02(0) - - IC)]TC-i[w(0) - - IC)]. 


3.4. Limber inversion and space clustering 

Using the measurements of the angular clustering, we 
can infer the three-dimensional clustering properties based 
on the known redshift distrib ution via Limber’s equation 
(jPeeblesI 119801 iLimbed 1195,811 . When dealing with small 
scales both angular and spatial co rrelation functions are 
well described by power laws (e.g., iDavis fc Peebles! Fl 98311 
and t herefore Limber’s equa tion can be presented in the 
form (jEfstathiou et al.lll99lll . 


A ri l(z)(dN/dz)2dz 

[/(dN/dz)]2 


(9) 


where dA is the angular diameter distance, x{z) is the 
derivative of proper distance with redshift. 


v^r( V) 

r(7/2), 


( 10 ) 


and dN/dz is the redshift selection function, which in case 
of our study is derived empirically from SED fitting (see 

Fig.lSD. 


3.5. Galaxy bias 

In o rder to r elate galax y clusteri n g to d ark matter cluster¬ 
ing, E aiseil (^1984^ and iBardeenI fll986ll introduced a bias 
parameter, a quantity describing the differences between 
the clustering of a galaxy field and the underlying mass 
distribution, i.e., 

b‘^{r,z,M) = ^g(r,z,M)/^rnir,z), (11) 

where ^g{r, z, M) is the correlation function of the inves¬ 
tigated galaxy population, and ^rnir,z) is the correlation 
function of the dark matter. Bias is dependent on scale (r), 
redshift (z) and objects’ mass (M). In terms of the assumed 
fluctuations of mass traced by galaxies (ug) and mass (am), 
the bias parameter should satisfy the relation 

0'(g,R) = bcr(ni,R)- (12) 

It can be presumed that b is invariant as long as the scales 
are large enough. In this case it is customa ry to comput e 
Or for a representative scale R =8 h“^Mpc (lQuadrill2007ll . 


To obtain the value of cr(m_ 8 ) for different cosmic epochs, 
one can use a following formula: 

C^(m. 8 )(z) = O-(m. 8 )( 0 )D(z), (13) 

where cr(m, 8 )( 0 ) is a present day (z = 0 ) mass fluctuation, 
which on the scale R of 8 fe~^M pc has a value of 0.83 
(iPlanck Collaboration et ^l2014ll : and 


D(z) 


g(z) 

g( 0 )(l + z)’ 


(14) 


where g{z) is a normalized growth factor that describes the 
growth rate of the matt er density fluc t uatio ns, and can be 
calculated, according to ICarroll et al.l (|l992ll . as follows: 


9iz) 


^n^{z)[n^\z) - nA{z)+ 


(15) 


The value of the fluctuations of the galaxy density (ug^s) 
with respect to the assumed scale can be obtained by im¬ 
plementation of the correlation function slope 7 and corre¬ 
lation length ro, i.e., 


where J 2 = 72/(2'>'(3 — 7)(4 — 7)(6 — 7 )). 

4. Results 

Following the procedures described in the previous sub¬ 
sections we compute uj{9) for the 24 pm selected galaxy 
catalog. The angular distances were measured in bins of 
Alog = 0.2. We then fitted the data using a technique 
with 2 tT clipping and determined the errors from the covari¬ 
ance matrix. 

The results of the power-law fitting to the correlation 
function between the full catalog (with y^ > 99) and sub¬ 
catalog of the best redshift quality (y^ < 10 ) show, that 
the two samples differ significantly. The full catalog showed 
ro = 4.93 ± 0.49 /i“^Mpc, while the subcatalog showed 
ro = 6.47 ±0.56 h-^Mpc. 

The low ro value for the sample containing all y^ values 
but the failing ones could be a result of the fact that it 
contains galaxies with lower fluxes, which would dilute the 
clustering signal. 

The difference is significant between the fit values for 
the subsamples with y^ smaller then 10 and the full catalog. 
Depending on the y^ cut we see drastic changes in both the 
strength and in the slope of the clustering. 

The most secure galaxies (with redshift estimation on 
the level of y^ < 10 ) exhibit a much larger correlation 
length when compared with the whole sample. The y^ < 10 
galaxies are concentrated around z ^ 0.7 (dash-dotted line 
on Fig. [i). The brightness distribution of those sources re¬ 
veals that this sample consists mostly of the objects with 
S 24 /im > 500 pJy. This result could indicate that galaxies 
with the most secure redshifts are the most luminous ob¬ 
jects within the AKARI sample. In the following analysis, 
we only focus on this subsample of galaxies. We present the 
24 pm flux as a function of redshift for the y^ < 10 sample 
in Fig. [71 
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Fig. 7: 24 /im flux of objects from < 10 sample as a func¬ 
tion of their redshift. The flux limit, marked as a red line, 
was derived from maximum of differential number counts 
of detected sources. 


4.1. High- and low-redshift population sample 

The galaxies with a successfully calculated zcigale on the 
goodness of fit level of < 10 were divided into four red¬ 
shift ranges based on the shape of resultant distribution: 1) 
the low-z sample composed of sources with z < 0.5; 2) the 
low-intermediate sample with 0.5 < z < 0.9, which includes 
the primary and most evident peak of the distribution 
which, as stated before, could be a result of 16.3 pm and/or 
17 pm PAH emission features passing through 24 ^m pass- 
band; 3) the intermediate sample with 0.9 < z < 1.3, in¬ 
cluding the secondary peak attributed to the 12.7 pm PAH 
and/or 12.8 pm [Nell] emission features, which at these 
redshifts would pass through the 24 ^m passbands; and 
4) the high-z sample with z > 1.3, a range, which in¬ 
cludes the possible peak at z ~ 2.4 in which we expect 
a significant incompleteness since sources could fall out of 
the detection limits due to the deep silicate absorption fea¬ 
tures at ^ 9.7 pm. At z ^ 1.5, these features would appear 
in ^24 pm passbands. For the subsamples divided in this 
way, we estimated the angular correlation function. What 
is more, as a by-product of the photometric redshift esti¬ 
mation we were able to obtain the estimates of the Ltir 
for the galaxies in the AKARI 24 pm selected sample. The 
details of the analysis of the physical parameters for the 
sources from AKARI NEP Deep Field using CIGALE will 
be addressed in Malek et. al (in prep) and Buat et al. (in 
prep). Here we use the output for general purposes only. 

Fig. [5] shows the results of this calculation, while the 
Table|5]lists the derived power-law fitting parameters along 
with the corresponding correlation lengths, Ltir, and bias 
(see subsection 4.2) for galaxies in each redshift bin. Table|3] 
summarizes the amount of the catastrophic errors obtained 
within each considered redshift range. 

For the lowest redshift interval (z < 0.5), which pre¬ 
dominantly contains galaxies with total infrared (TIR) lu¬ 
minosities of an order of lO^^L©, the derived correlation 
length is equal to 3.62 ± 0.79/i“^ Mpc. At redshifts 0.5 < 
z < 0.9, we start to observe a LIRG population {Ltir ^ 


lO^^L©), which displays a higher tq: 6.21 ± 0.78/i“^ Mpc 
and mean Ltir^ 1.4 • lO^^L©. 

The higher-z population (0.9 < z < 1.3) displays a 
correlation length (rp = 5.86 ± 0.69 h~^ Mpc) similar to 
that of the lower-z population, however, it displays a sig¬ 
nificantly higher Ltir 7 • lO^^L©. The correlation func¬ 
tion of galaxies in the z > 1.3 has a correlation length 
of 7.23 ± 0.87 Mpc h~^. Even though this sample has the 
largest redshift dispersion, the clustering signal is not a sub¬ 
ject to any substantial dilution. Nevertheless, the amount of 
the catastrophic errors within this bin is high (~ 83%, see 
Table 131); therefore without any additional follow-up mea¬ 
surements of redshifts to confirm our results, conclusions 
based on this bin cannot be reliable. 



Fig. 8: Two-point angular correlation function calculated 
for the subsamples divided according to the derived photo¬ 
metric redshifts with < 10. Squares represent the low z 
sample (z < 0.5), diamonds a sample with 0.5 < z < 0.9, 
circles the sample with 0.9 < z < 1.3, and triangles the 
high z sample with z > 1.3. 


Table 3: Comparison of zcigale vs Zspec as a function of 
^bin - percent of CE in the redshift bin, ptot - percent 
of CE in the total matched sample. 



•Grange 

Ngal 

Ncb 

rjhin [%] 

Vtot [%] 

10 

z < 0.5 

32 

0 

0.00 

0.00 

10 

0.5 < z < 0.9 

5 

1 

20.00 

1.92 

10 

0.9 < z < 1.3 

6 

3 

50.00 

5.77 

10 

1.3 < z 

6 

5 

83.33 

9.62 


4.2. Galaxy bias 

We study the bias b of the 24 ^m selected AKARI galaxies 
as described in subsection 3.5. We leave a more detailed 
study using a full Halo Occupation Distribution (HOD, 
IScoccimarro et aLll200lll formalism for future work. 
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Table 2: Clustering of 24 /im galaxies in AKARI NEP Deep Field. 


X 

2 range 

Ngal 

7 

ro [6 ^Mpc ] 

Ltir 

b 

10 

z < 0.5 

168 

1.99 ±0.06 

3.62 ±0.76 

10.276 ±0.81 

0.89 ±0.20 

10 

0.5 < z < 0.9 

207 

1.69 ±0.03 

6.21 ±0.78 

11.15 ±0.315 

1.72 ±0.25 

10 

0.9 < z < 1.3 

159 

1.57 ±0.10 

5.86 ±0.69 

11.84 ±0.36 

1.91 ±0.21 

10 

z > 1.3 

77 

1.85 ±0.12 

7.23 ±0.87 

12.43 ±0.44 

3.53 ±0.51 


5 

4 

3 

2 

1 t 

0 

0.0 0.5 
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Fig. 9: Linear bias as a function of redshift for AKARI 
photometric redshift samples (filled circles). Dashed curves 
represent the theoretical linear halo bias evolution of dark 
matter halos of minimal masses: 10^°, 10^^, 10^^, and 10^^ 
(from bottom to top). As a reference we s how results from 
the lit erature: triangles mark r esults from Starikova et al.l 
(|2012ll . squares the res ults fromiMagliocchetti et al.l (12001^ . 
cross esthe results f rom iGilli &: PaddiT i 200711 . and asterisks 
from iDollev et aO ()20I4ll . 


Once the bias factors have been computed for all 
AKARI galaxy subsamples (section 3.5), it is possible to 
trace their evolution with redshift. This procedure can give 
an insight into what kinds of galaxies the 24 pm selected 
samples could correspond to at the current ep och: z = 0 
(jNusser fc Davis! 11994 iMoscardini et al.l 11991^ . This ap¬ 
proach is however simplified: it is based on the assump¬ 
tion that galaxies throughout their evolution do not inter¬ 
act with each other and they are only influenced by their 
density field. We can track the evolution by the relation, 
i.e., 


b{z) = 1 + 


m -1 

D{z) 


(17) 


where D(z) is given by eq. [T31 and 6(0) is a value of bias at 
z = 0. 

In Fig, in] we show the l inear bias evolution derived 
from ISheth fc Tormeiil (Il999ll formalism for varying min¬ 
imum dark matter halo (DMH) mass thresholds. The min¬ 
imum masses of DMH for galaxies within redshift inter¬ 
vals of z < 0.5 and 0.9 < z < 1.3 are located between 
IO^^Mq/i"^ <Mh< IO^^Mq/i-i. For 0.5 < z < 0.9 inter¬ 
val, galaxies are expected to have a slightly higher minimum 
mass than other considered samples: IO^^Mq/i”^ > Mh- 


However, since values of bias parameters for the intervals 
of 0.5 < z < 0.9 (6 = 1.72 ± 0.25) and 0.9 < z < 1.3 
(6 = 1.91 ± 0.21) are in agreement within their error bars, 
this result for the lower z sample could be misleading. 
Nevertheless, if it is a true effect, it could indicate that 
IR galaxies with higher Ltir do not necessarily reside in 
more massive DMHs, as in the case of redshift intervals of 
0.9 < z < 1.3 (with Ltir ^ 7 • lO^^L©) and 0.5 < z < 0.9 
(with Lttb ^ 1.4 • lO^^ Lf?^). This idea is consistent with 
the work of iGotol (|2005h . where it has been reported (for 
the IR galaxies observed by IRAS) that more IR lumi¬ 
nous galaxies tend to have smaller local density. They ar¬ 
gue that luminous infrared galaxies with moderate total 
infrared luminosities (lO^^ ^L© < Ltir < IO^^Lq) exist 
in higher density regions than IR galaxies with 10 ^^Lq < 
Ltir < 10^“^Lq. Therefore there must exist a discrepancy 
in the origin of these two potentially different populations. 
If LIRGs are created th rough a merger and/or interaction 
of sev eral galaxies fe.g.. lTaniguchi et al.lll998l iBorne et al.l 
l2000ll . then their local environment would naturally be 
lower. Then, the higher minimal DMH of the AKARIs’ 
0.5 < z < 0.9 could mean that within this interval we still 
observe a substantial fraction of normal spiral galaxies in 
addition to interacting systems. It has been reported that 
up to z ^ 0.5 — 0.7 more than a half of t he LIRG popula¬ 
tion is composed of normal spiral galaxies (|Bell et al.ll2005l : 
[Melbourne et al.l l2005t lElbaz et'Sl 1200711 . Beyond those 
redshifts, mergers start to enhance the Ltir and deplete 
the environment. However, a relatively large tq of AKARIs’ 
galaxies is contradictory to local measurements of cluster¬ 
ing l engths of star-fo r ming spiral galaxies (rn ^ 4 /t~^Mpc ; 
e.g-.. iGoil et al.ll2004 iMeneux et al.l[20?)5 Goil et all 120081 : 
iHeinis et al.l2009ll . On the other hand, AKARI NEP Deep 
survey is flux limited, therefore, at larger redshifts it pref¬ 
erentially and intrinsically detects more luminous galaxies, 
which in turn are more strongly clustered. This could result 
in an increase of the measured rg in this redshift range. This 
issue remains undetermined and a more detailed analysis 
of those galaxies is needed to determine their true nature, 
which is a subject of future research. 

With known 6(z), we can trace the evolution of 'Cg(r=8,z)i 
and therefore the evolution of the ro(z), by inverting Eg. 1111 
This procedure can lead to the determination of a possible 
fate of each considered population. 

The results are shown in Fig.jTUl AKARI 24 pm galaxies 
at Zmed = 0.25 exhibit a steep correlation function slope 
(7 ^ 2) and a correlation length of 3.62 ± 0.76 6,“^Mpc. 
Their evolution with time could result in reaching rg ^ 4 
/i“^Mpc at the present day. Therefore, according to those 
measurements, the majority of the AKARI galaxies at low-z 
are expected to fall into this category. 

The galaxies within redshift ranges of 0.5 < z < 0.9 
and 0.9 < z < 1.3 could evolve into a very strongly clus- 
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tered population of galaxies, with tq ^ 8-8.5 h“^Mpc. In 
the local Universe, early type galaxies with L have 

been me asured to have sim ilar correlation lengths: rg ^ 8 
h“^Mpc (|Guzzo et al.111997^1 . This means that those objects 
could have evolved from the population of LIRGs at red- 
shifts ^ 0.9. However, the slope of the clustering of AKARI 
galaxies (7 ^ 1.65) is lower than that measured for the local 
ellipticals (7 ^ 2). 



0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 
redshift 


Fig. 10: Correlation length of AKARI NEP Deep galaxies 
(red filled circles) compared with the evolutionary tracks as 
expected from the lin ear scenario (d a shed lines). Triangles 
denote results f rom iGilli fc Daddil (l2007fl . squares from 
IStarikova et al.l (|2012ll . crosse s from Magliocchetti et al.l 
(j2008[) . reverse d triangles fro m B rod win et al. ( 200811 . and 
diamonds from iDollev et all (|2014ll . 


4.3. Comparison with previous studies 

Figures [9] and [To] present a comparison of our derived spa¬ 
tial correlation lengths and biases for the redshift divided 
samples with the results from the literature. 

As mentioned in Section 1, several previous studies have 
approached the task of analyzing the 24 pm selected galax¬ 
ies by dividing the samples based on their redshift mea¬ 
surements. A direct comparison with previous studies is 
not a straightforward procedure because of the varying sur¬ 
vey sizes and depths, different methods used to select the 
galaxy samples, and the difference in the considered red¬ 
shift ranges. We present the first non-Spitzer based mea¬ 
surement of clustering of 24-/im selected sources. Table 01 
shows a comparison of the previously investigated surveys 
of 24 pm selected galaxies throughout the literature. 

It is worth noting that the source selection, which we 
have relied upon for this analysis, was based on the rough 
spectral shape of the considered sources. This excludes the 
possibility of mixing AGN dominated galaxies, in contrast 
with samples chosen without this kind of criterion. 

Measurements of the correlation length of local (0.3 < 
z < 0.6) blue galaxies have been reported in several pre- 
vious studies to be on the order of ^ 3.7 /i“^Mpc (e.g., 
ICoil et ^l2004ll . This was found to be in agreement with 
the measurements of the rest-frame selected UV galax¬ 


ies ('see [Milliard et alll2007l iHeinis et aI1l2009ll . Since this 
value is consistent with that measured for the lowest-z 
AKARI galaxy subsample (rg = 3.62 ± 0.76 /i“^Mpc), it 
could mean that it is mainl y composed of the local, norma l 
SF galaxies. The study by iMasci fc SWIRE Team! (1200611 
for 24 pm selected galaxies in SWIRE survey found similar 
values of rg: 3.32±0.19 /i“^Mpc with the steep slope 7^2. 

Eor the sample of ^ 1300 objects (S'24;jm > 20 pJy) 
with an average Ttr ^ 4 ■ lO^^Lgi wi thin the GOODS fields 
with Zmed ^ 0.8. lGilli fc DaddfcOOTl have found the correla¬ 
tion length equal to rg = 4.0±0.4 h“^Mpc. They have found 
that for LIRGs (Lir > lO^^L©) rg rises up to 5.14 ± 0.76 
/i“^Mpc. These values are in agreement with that obtained 
for the LIRGS found in AKA RI NEP-Deep survey . 

The analysis performed bv lMasrliocchetti et al.ll2?)?)^ for 
1041 galaxies brighter than S'24^m = 400 pJy resulted in 
obtaining the correlation lengths for low-z (zmean 0.8) 
and high-z (zmean 2) sources to be equal to rg = 5.95lit;2e 
/i“^Mpc and rg = 11.13l2'38 ^~^Mpc, respectively. The 
value obtained for the low-z sample, z = [0.6 — 1.2], is 
comparable with that derived in this work (rg = 5.86±0.69 
/i“^Mpc) for 0.9 < z < 1.3 range. This evidence for both 
studies at these redshifts points to the fact that they trace 
a population of galaxies that could evolve into present day 
rg ^ 8 /i“^Mpc ob jects (possibly elliptic als with L ^ L*). 

The study by IStarikova et al.l 120121 based on Spitzer 
Wide-Area Survey within the SWIRE Lockman Hole field 
for more than 20000 objects brighter than S'24;im > 310 ^Jy, 
has shown that the correlation length for the low- (zmean ^ 
0.7) and high-z (Zmean 1-7) samples are rg = 4.98 ± 
0.28 Mpc h“^ and rg =8.04 ± 0.69 Mpc h“^, respectively. 
Our derived values for 0.5 < z < 0.9 are slightly higher 
(rg = 6.21 ± 0.78 /i“^Mpc). The divergence of the results 
could arise because the widths of redshift intervals that are 
being compared are different. The low-redshift interval of 
Starikova et al. covers a range from ^ 0 to 2, with the mean 
z ^ 0.7. In AKARI’s range for Zmedian ^ 0.7, we have not 
included the galaxies at z lower than 0±; _ 

Eurthermore, the recent study by iDollev et al.l (|2014f) 
from the Spitzer observations of the Bootes field of over 
22 000 24 pm galaxies up to z ^ 1 shows correlation lengths 
varying with redshift, from rg ^ 3 to rg ^ 6 /i“^Mpc. 
Subsamples at zmedian ^ 0.3 with rg = 3.18±0.44 h“^Mpc 
and Zmedian ^ 0.9 with rg = 5.86 ± 0.66 /i“^Mpc show 
a good agreement with our results at similar median red- 
shifts: rg = 3.62 ± 0.76 /i“^Mpc at Zmedian ^ 0.25 and 
rg = 6.21 ± 0.78 h-^MpC at Zmedian ^ 0.7. 

It is also worth noting that the subsample of AKARI 
galaxies with the highest z indicates a correlation length 
value (rg = 7.23 ± 0.87 /i~^Mp c ), w hich is consistent 
with the work of iBrodwin et al.l (|2008ll . Their investiga¬ 
tion was focused on clustering analysis of ^ 2600 dust ob¬ 
scured galaxies (DOG) with spectroscopic redshifts located 
at 1.5 < z < 2.5. A detection of strong clustering with 
rg = 7.4lJ;g4 /i“^Mpc for a 824,^111 > 300 ^Jy flux cut was 
reported. However, these values from the AKARI analysis 
were derived from small number statistics, and therefore, 
these results should be treated with caution. 


5. Summary and conclusions 

We have presented a clustering analysis of the 824 .^^ se¬ 
lected AKARI NEP Deep field galaxies down to a flux limit 
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Table 4: Summary and comparison of the previous 24 /rm selected galaxy surveys. The work of Gilli & Daddi (2007) 
is based on spectroscopic measurements in contrast with other studies (based on photometric measurements). The 
parameter p stands for surface density. 



depth [/rJy] 

p [Ngai/deg^] 

size [deg^] 

probed z 

7 

ro[ h ^Mpc] 

Gilli & Daddi 2007 
Magliocchetti et al. 2008 
Starikova et al. 2012 
Dolley et al. 2014 

20 

400 

310 

223 

13060 

1490 

410 

2678 

0.1 

0.7 

49 

8.42 

~ 1 

[0.6-1.2] & < 1.6 
~ 0.7 & ~ 1.7 

0.2 < z < 1.0 

1.5 ±0.1 

1.8 

1.8 - 2.13 

4.0 ±0.4 

5.95lji°& 11.13t2;03 
4.98 ±0.28&8.04± 0.69 
3-6 

AKARI 

150 

3350 

0.4 

^T3) 

1.79 ± 0.04 

4.93 ± 0.49 


of 150 /rJy. This is the first study of these kinds of sources 
that are not based on Spitzer telescope measurements and 
features a source selection that does not rely on any auxil¬ 
iary observations from other wavelength intervals. 

After performing the sample selection through the use 
of the SVM classifiers trained on the infrared color infor¬ 
mation, we used the CIGALE code to estimate the photo¬ 
metric redshifts for the chosen galaxies, and we were able 
to establish a redshift estimate for 96.11 % of the sample. 
The resultant redshift distribution revealed three peaks: the 
primary one located at z ^ 0.6, which can be attributed 
to either the evolution in the luminosity function or to 
the increase of the detection volume; a secondary peak lo¬ 
cated at z ^ 1.2 possibly related to the 12.7 and 12.8 ^m 
PAH emission lines passing through the 24 pm passband at 
these redshifts; and an indication of a possible third peak at 
z ~ 2.4. Even though AKARl’s high-z peak is too tentative 
to draw any firm conclusions about any possible popula¬ 
tion of galaxies, several models predict its existence and 
attribute it to the sources dominated either by AGNs or by 
a PAH emission feature at 8 ^m. 

For the calibration of those measurements, we have com¬ 
pared them with the spectroscopic redshifts for counter¬ 
parts available from Murata et al. catalog (149 objects) 
and estimated that, depending on the goodness of fit value, 
we encounter a 18.75 % rate of CE for < 10 and 35.22 % 
for < 70. Therefore, the following clustering analysis was 
performed only on a subsample of galaxies with the most se¬ 
cure redshift estimation. To obtain the most reliable results, 
we substituted the photometric redshift measurements for 
the objects possessing the spectroscopic redshift measure¬ 
ments. With a set of galaxies prepared in the way described 
above, we examined the clustering properties of the derived 
samples with respect to their redshift distribution. 

Using a power-law approximation to derive correlation 
function together with the estimated photometric redshift 
information, we obtained the spatial correlation length rg. 
For the full sample, the derived value equals tq = 4.93±0.49 
h“^Mpc, which is consistent with the previous studies con¬ 
ducted for the same wavelengths. For the specific redshift 
intervals we have found the following facts: 

1. For z < 0.5 ro = 3.62±0.79 h-^ Mpc, Ltir 10^°Lq. 
This indicates that this redshift interval is mostly com¬ 
posed of normal SF galaxies. Their bias parameter is 
equal to 0.93 ± 0.11 and their estimated minimal DMH 
mass is Mh > 10 ^^Mq. 

2. For 0.5 < z < 0.9 and 0.9 < z < 1.3: 

— correlation lengths are 6.21 ± 0.78 h~^ Mpc and 
5.86 ± 0.69 h~^ Mpc, respectively ; 


— galaxies exhibit very similar, relatively high val¬ 
ues of bias parameter (^ 1.7 — 1.9). Those values 
ar e consistent wi t h the linear bias derived in work 
of iLagache et al.l l)2007f) . which was measured from 
Cosmic Far-Infrared Background anisotropies at 160 
pm {b = 1.74±0.16). Both measurements probe sim¬ 
ilar redshift range {zRagache ^ !)• 

— the sample at 0.9 < z < 1.3, composed of brighter 
galaxies {Ltir ^ lO^^ ^'^L©), seems to be residing 
in lower minimal mass DMH {Mh < IO^^Mq/i”^) 
than the lower luminosity (with Ltir ^ lO^^'^^L© 
and Mh > IO^^Mq/i”^) galaxies at 0.5 < z < 0.9. 
This could indicate that brighter infrared galaxies 
do not necessarily reside in more massive halos. This 
means that despite similar clustering properties, we 
are dealing with two different populations of star¬ 
forming galaxies. The redshift distribution shows 
two distinct peaks (at ^ 0.6 and ~ 1.2): the pri¬ 
mary peak in the redshift distribution at ^ 0.6 could 
contain a mix of both normal star-forming galax¬ 
ies and LIRGs. The dip in the distribution between 
the peaks at ^ 0.9 could mean that normal galax¬ 
ies slowly fade out from the field of view to reveal a 
more prominent LIRG population at redshifts ^ 0.9 
and higher. 

— Extrapolating the AKARI data to the present day 
epoch has allowed us to investigate the approximate 
descendants of each galaxy subsample. Working un¬ 
der the assumption that galaxy interactions do not 
affect the am value at scales as large as 8 Mpc, 24 ^m 
galaxies at redshifts 0.5 < z < 1.3 could evolve into 
current galaxies with rg ^ 8 /i“^Mpc, a value mea¬ 
sured for ellipticals with L ^ L^. 

3. Galaxies at z > 1.3 display a strong clustering signal at 
small scales despite the large size of the redshift range. 
This could indicate an existence of separate, highly clus¬ 
tered population(s). However, because of small number 
statistics burdened with a large number of GE, these 
results remain unconfirmed. 
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